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SPECIFICATION 



Electronic Version 1.2.8 
Stylesheet Version 1 .0 

[PIPELINED LOW COMPLEXITY 
FFT/IFFT PROCESSOR] 

Background of Invention 

[0001 ] 1 . Field of the Invention 

[0002] ji^g present invention relates to signal processors. More specifically, a radix-2 ^ 
Inverse Fast Fourier Transform (IFFT) processor Is disclosed. 

[0003] 2. Description of the Prior Art 

[0004] For Orthogonal Frequency Division Multiplexing (OFDM) systems, Inverse Fast 
Fourier Transform/Fast Fourier Transform (IFFT/FFT) processors are generally 
in the modulation/demodulation process to achieve effective multicarrier 
transmissions. Many OFDM systems, such as the OFDM system used by the WLAN 
802.1 la standard, require IFFT/FFT processors that provide high speed, real-time 
throughput in combination with a low complexity implementation to obtain high data 
rates. Meeting these criteria is an on-going objective. 

[0005] E.H. Worl and A.M. Despain in their article "Pipeline and Parallel-pipeline FFT 

Processors for VLSI Implementation" from IEEE Trans. Comput., C-33(5): 414-426 of 
May 1984, included herein by reference, describe a radix-2 pipelined Single-path 
Delay Feedback (R2SDF) FFT that is capable of providing high-speed, real-time 
processing. However, such a design requires (log ^ N- 1) complex multipliers for an 
N-point FFT, which Implies a relatively complex Implementation. 

[0006] Shousheng He and Mats Torkelsson disclose in their United States patent 

6,098,088, which is included herein by reference, a radlx-2 ^ Decimation-ln- 
Frequency (DIF) FFT algorithm and associated architecture that lowers the required 

complexity by bringing the number of required complex multipliers down to (log N- 

4 

APP_ID=1 0065 154 Page 1 of 55 



m/e^a€i s; :i s h- . o ^ 3 a s 



1) for an N-point FFT. Additionally, Shousheng He and Torkelson, M. also disclose in 
their article, "A new approach to pipeline FFT processor" in Parallel Processing 

th 

Symposium, 1 996, Proceedings of IPPS "96, The 1 0 International, 1 996, included 

3 

herein by reference, a radix-2 DIF FFT algorithm that requires only (log N-1) 
complex multipliers. However, no architecture related to this algorithm is disclosed. 

[0007] Beyond the demands of low complexity and high speeds, IFFT/FFT processors 
suffer from disorder in the output or input streams. DIF FFT processors and DIT 
(Decimation In Time) IFFT processors provide ordered inputs, but disordered outputs. 
DIT FFT processors and DIF IFFT processors, on the other hand, provide unordered 
Inputs and ordered outputs. For example, a 16-point DIF processor, as disclosed in US 
patent number 6,098,088, sequentially clocks in as input points x[0] to x[l 5]. These 
points are input in order. The output frequency values X[0] to X[l 5], however, are not 
clocked out in order. Instead, they are presented in sequence as: X[0], X[8], X[4], X 
[1 2], X[2], X[l 0], X[6], X[l 4], X[l ], X[9], X[5], X[l 3], X[3], X[l 1], X[7] and finally X[l 5]. A 
DIT FFT processor simply accepts disordered inputs to provide ordered outputs. In 
either case, the lack of order on either of the input or output sides imposes additional 
burdens on circuitry that utilizes the IFFT/FFT processor. 

Summary of Invention 

[0008] It is therefore a primary objective of this Invention to provide an architecture that 

3 

implements a radix-2 algorithm for an IFFT/FFT N-poInt processor. The architecture 

requires only (log N -1) complex multipliers, 2 x log N tt /2 complex rotators, 
8 8 

and log N tt /4 complex rotators. 
8 

[0009] It is a further objective to provide a real-time architecture that utilizes a triplet 

butterfly circuit that includes a butterfly I circuit, a butterfly II circuit and a butterfly III 
circuit. Each of these butterfly circuits has a relatively simple architecture that is 
controlled according to a pipeline step-count of the processor control circuitry. 

It is yet another objective to provide an IFFT/FFT processor with a reordering 
circuit so that both the inputs and the outputs of the IFFT/FFT processor are ordered 
in time. 

Briefly summarized, the preferred embodiment of the present invention discloses 



[0010] 



[001 1] 
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a real-time pipelined N-point transform processor that contains a first butterfly triplet 
multipllcatively connected to an output portion by way of a complex multiplier. The 
butterfly triplet contains a first butterfly I unit (BFI), a butterfly II unit (BFII) and a 
butterfly III unit (BFIII), which are connected together in series. An input port of the 
first BFI serves as an input port of the triplet to accept complex numbers, and an 
output port of the BFIII serves as an output port of the triplet. The complex multiplier 
accepts a complex result from the output port of the first triplet, and a coefficient 
provided by a control unit to generate a complex product. The output portion contains 
at least a second BFI, an input port of the second BFI accepting the complex product 
from the complex multiplier, and the output portion then provides the transformed 
complex numbers. The control unit contains a pipeline step-count register, and the 
ability to provide the coefficients to the complex multiplier. The control unit controls 
each BFI, each BFII, each BFIII, and provides each coefficient, according to a value held 
in the pipeline step-count register. A reordering circuit Is provided to insure that the 
time domain order of the transformed complex numbers matches the frequency 
domain order of the input complex numbers. 

[0012] It is an advantage of the present invention that the butterfly units BFI, BFII and BFIII 

that make up the butterfly triplet and output portion are easy to implement. Further, 

the present invention reduces the number of complex multipliers down to an order of 

(log N- 1). Yet another advantage is that the reordering circuit ensures that the 
8 

output transformed complex numbers occur In the order as provided by the input 
complex numbers. Hence, circuitry utilizing the present invention processor does not 



need to reorder the time or frequency domain, thus reducing implementation burdens 



on external circuitry. 



[0013] 



These and other objectives of the present invention will no doubt become obvious 
to those of ordinary skill in the art after reading the following detailed description of 



the preferred embodiment, which is illustrated in the various figures and drawings. 



Brief Description of Drawings 



[0014] 



Fig.l illustrates a process diagram for a general butterfly circuit. 



[0015] 



Fig. 2 is a process diagram for a 1 6-point radix-2 Decimation in Time Inverse 
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Fast Fourier Transform (DIT IFFT) process according to tlie present invention. 

[001 6] F\g3 Is a schematic design for the 1 6-point radix-2 ^ DIT IFFT process of Fig. 2. 

[001 7] Fig.4 is a schematic diagram of a general butterfly unit BFI according to the 
present invention. 

[001 8] Fig. 5 is a schematic drawing of a general butterfly unit BFII according to the 
present invention. 

[001 9] Fig. 6 is a schematic drawing of a general butterfly unit BFlll according to the 
present invention. 

[0020] Fig. 7 is a schematic drawing of a tt /2 complex rotator 400 according to the 
present invention. 

[0021] Fig. 8 is a schematic drawing of a tt /4 complex rotator according to the present 
invention. 

[0022] Fig. 9 is a process diagram for a 32-point radix-2 ^ DIT IFFT process according to 
the present invention. 

[0023] Fig.l 0 is a schematic design for the 32-point radix-2 ^ DIT IFFT process of Fig. 9. 

[0024] Figs.i 1 A and 1 1 B are process diagrams for a 64-point radix-2 ^ DIT IFFT process 
according to the present invention. 

[0025] pjg^ 2 is a schematic design for the 64-point radix-2 ^ DIT IFFT process of 
Figs.i lAand 1 IB. 

[0026] -| 3 j5 ^ schematic design for a 1 28-point radix-2 ^ DIT IFFT processor 

according to the present invention. 

[0027] Fig.l 4 is a simple block diagram of an IFFT/FFT processor according to the 
present invention. 

[0028] J 5 is a block diagram of a 1 6-point radix-2 ^ DIT IFFT processor supporting 

ordered outputs according to the present invention. 

[0029] pjg^ 6 is a block diagram of a 1 6-point radix-2 ^ DIF IFFT processor supporting 
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ordered inputs according to the present invention. 



Detailed Description 



[0030] 



In the following detailed description of the preferred embodiment design, a 



Decimation in Time (DIT) Inverse Fast Fourier Transform (IFFT) circuit Is disclosed, as 
such a circuit utilizes (j) mathematical coefficients rather than (-j) coefficients, and 
thus reduces the overall complexity of the circuit. However, those skilled in the art will 
realize that it is a trivial matter to utilize the teachings of the present invention to 
build other types of related circuits, such as a Decimation in Frequency (DIF) FFT 
design, as the transformation from a DIF design to a DIT design, and from an IFFT to 
an FFT, involves little more than a change of mathematical coefficients and 
conjugation of the inputs/outputs, respectively. An overview of the mathematical 
basis of the present invention is beneficial, as it aids in the understanding of the 
related butterfly circuits and determination of the various coefficients that are 
provided by the processor control circuitry to the complex multiplier(s). An N-point 
Inverse Discrete Fourier Transform (IDFT) has the general formula of: ' 



[0031] 



N 



(Eqn. la) 



[0032] 



In Eqn. la, are position outputs, X[n\ are frequency inputs, 0<n<N, 0<k 
<. N, and: 



[0033] 



(Eqn. lb) 



[0034] 



By recursively applying a radix-8 followed by a radix-2 index map, the DIT version 



Is obtained when substituting the indices of Eqns.T a and 1 b with: 



[0035] 




[0036] 



and 



[0037] 



W = /?! + 2/22 ^ 4f?3 + 
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[0038] 
[0039] 
[0040] 
[0041] 
[0042] 
[0043] 
[0044] 
[0045] 
[0046] 
[0047] 
[0048] 



where: 



0^ k ^ (N/8 - 1), 
4 

0^ Ar^ ^ 1, 



0^/7 ^ (N/8-1), 
4 



0^ n l,and 



0 ^ ^ 1 



The resulting expression is then given by: 



N 



[0049] 
[0050] 
[0051] 



(Eqn.2) 
where: 



[0053] 



N N N 

VI7f^—lV^^ 1 ^ 3 g 3 4A I 2 i KJ 



[0052] 



If we set: 
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7/ 



C4 = IF'^^*"* 



[0054] Then Eqn.2 can be rewritten as: 



[0055] ^[^^ ^ + 4«3 +8M4] = 

AT . 



2 4 



8 



[0056] Butterfly BFI is identified in the above as: 

[•0057] N N N N N N N 

Bn(-k2+-k3 + k4,n,)=X(-k,+-k3+kJ + (-l)^X(^ + ^1.2+^ 



[0058] 
[0059] 



With this, Eqn.2 is then rewritten as: 



x[ni + -f- 4n3 + Sn^] = 

8 1 



[0060] 
[0061] 

[0062] 
[0063] 



Butterfly BFII Is identified in the above as: 



BFn(|^k3 + k,, n^, n^) = [BFI(|^k3 + k^, n,) + ay^^^^^^^ 
Eqn.2 can then be further rewritten as: 



x[nj + 2x1^ + 4n3 + Sn^] = 



k^-o 4 8 



[0064] 
[0065] 



Finally, butterfly BFIII is identified above as: 



APP ID= 10065 154 



Page 7 of 55 



J„ O Oi iS S. J„ S ,„ 01 '91 kii! '3. Ol S 



BFin(k4. n„n2,n3) = [BFn(k4,n,.nj) + W'^'""^"^^'^' BFn(:^ + ^'^3 + k^^n, .n^)] 



[0066] 



By further identifying a term: 



[0067] 



= BFIHx C3 



[0068] 



Eqn.2 can finally be rewritten as: 



[0069] 




(Eqn.3) 



x[n^ + 2/J2 + 4«3 + 8/I4] = 2 



[0070] 



It is noted that Eqn.3 is simply an (N/8)-point IFFT calculation. Hence, the above 



steps can be recursively applied until (N/8 ) — 8, where "p" is the depth of the 
recursion (i.e., how many times the steps are recursively performed). The above 

equations indicate that BFI, BFII and BFIII are serially linked together in order to form a 

butterfly triplet, and that butterfly triplets are multiplicatively linked together by way 

of the appropriate coefficients. The number of such complete butterfly triplets is "p", 

and Is finally determined by the number "N", i.e., the number of points handled by the 

IFFT processor. The output portion of the IFFT will contain at least a portion of a 

butterfly triplet, which is multiplicatively connected to the last complete butterfly 

triplet via appropriate coefficients. That is, the output portion may not contain a full 

set of the constituent butterfly parts BFI, BFII, BFIII. Where N = 2 " , if the value "n mod 
3" is one, then the output portion will contain only BFI, which will be the output port 

of the IFFT. If "n mod 3" is two, then the output portion will contain BFI and BFII in 

series, with BFII being the output port. If "n mod 3" is zero, then the output portion 

will contain the full complement of the butterfly constituent parts BFI, BFII and BFIII, 

with BFIII being the output port. 

[0071] With regards to the above equations, the following is noted. Butterfly BFII contains 
the coefficient ^.y„i+2nj) , which is a tt /2 complex rotator. Butterfly BFIII contains the 

coefficient: 

[0072] 
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8 8 

(Eqn.4) 

[0073] Eqn.4 can be realized by the cascading of a tt /2 complex rotator and a tt /4 

complex rotator. However, the tt /4 complex rotator, as it appears in Eqn.4, can be 
closely approximated by: 

[0074] rr (Eqn.5) 

(^(1 + ^ 1(2-' + 2-^ + 2-^ + 2-*^ + 2-') X (14- J)Y^ 

[0075] Eqn.5 can be quite easily implemented by way of five right shifters, one tt /2 
complex rotator, one 2-to-l complex adder and one 5-to-l complex adder. 

[0076] In the following, the concept of a butterfly circuit is used extensively. Fig.l 

illustrates a process diagram for a general butterfly circuit 10. From an algorithmic 
point of view, the butterfly 1 0 has two inputs 1 1 a and 1 1 b, and two outputs 1 2a and 
1 2b. Both inputs and outputs are for complex numbers, and thus may represent many 
signal lines depending upon the bit-size of the complex numbers. If input 1 la 
a complex number "A", and input 1 1 b accepts a complex number "B". then output 1 2a 
represents the complex number "A+B", and output 1 2b represents the complex 
number "A-B". A butterfly circuit will thus require a complex adder circuit and a 
complex subtractor circuit. 

[0077] 

Please refer to Fig. 2. Fig. 2 is a process diagram 20 for a 16-point radix-2 ^ DIT 
IFFT according to the present invention, as derived from the above equations. The 

butterfly units BFI, BFII and BFIII are indicated, serially linked together in order to form 

a single complete butterfly triplet. An output portion contains a single BFI unit, 

multiplicatively linked to the butterfly triplet. The output of the butterfly triplet, i.e. 

the output from BFIII, is fed into a complex multiplier, indicated by the " ® " symbol. 

Coefficients W'n are also fed into the complex multiplier, and the resulting complex 

product is passed into the output portion BFI. The value of Wn that is fed into the 
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complex multiplier will depend upon pipeline step-count, and is generally given by: 
[0078] Wn = exp(j x 2 tt x n/1 6) 

[0079] In particular, it should be noted that the term W2, which appears intermittently in 
BFIII, Is the TT /4 complex rotator that is approximated by: 

[0080] W'2 ^ 0.7071 + 0.7071 j 

[0081] Please refer to Fig. 3 in conjunction with Fig. 2. Fig. 3 is a schematic design 30 for 

3 

the 1 6-point radix-2 DIT IFFT process of Fig. 2. The circuit 30 includes a complete 
butterfly triplet 37 multiplicatively connected to an output portion 39 by way of a 

complex multiplier 38. The butterfly triplet 37 includes a first butterfly I unit (BFI) 31a, 

a butterfly II unit (BFII) 32, and a butterfly III unit (BFIII) 33. The output portion 39 

4 

contains a single, second, BFI unit 3 1 b (as 1 6 = 2 , and 4 mod 3 = 1 ). A control unit 
36 controls the operations of the BFIs 31 a, 31 b; the BFII 32, the BFIII 33, and provides 

appropriate coefficients to the multiplier 38. The control unit 36 includes a pipeline 
step-count register 36a, which keeps track of the current pipeline step-count, which 
runs from zero to N-1 for an N-point IFFT processor. The control unit 36 controls the 
butterfly triplet 37, the multiplier 38 and the output portion 39 according to the step- 
count register 36a. 

[0082] 

Please refer to Fig. 4 with reference to Figs. 2 and 3. Fig.4 is a schematic diagram of 
a general butterfly unit BFI 100 according to the present invention. The general 
butterfly BFI 1 00 contains a single complex input X ^ (k) 1 01 , and a single complex 
output X ^ (k) 1 02. The process diagram of Fig.l would seem to indicate that BFI 1 00 
should have two inputs and two outputs, however the actual implementation is not so 
restricted. On the contrary, the IFFT 30 has a pipelined architecture, and so inputs are 
not necessarily simultaneously available. However, two inputs 101 can be clocked in at 
two respective times, as indicated by the pipeline step-count value "k" in X ^ (k) 101, 
the value of which is in the step-count register 36a, and at some time later, two 
corresponding outputs 102 can be clocked out at their respective times, as indicated 
by the "k" X ^ (k) 1 02. Hence, there exists no actual conflict between the process 
algorithm, as depicted in Fig.l , and the physical implementation, as depicted in Flg.4. 
BFI 1 00 includes a delay feedback loop implemented with a buffer 1 03. The buffer 
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1 03, a first in first out (FIFO) buffer, holds storage for a predetermined number "L ^ " 
of complex values. The value of "L ^ " is given by: 

[0083] L ^ = N/(2 X 8 P ) 

[0084] The value "p" corresponds to the recursion number described above with respect 
to the mathennatical background, and indicates the butterfly triplet grouping number 
within which BFl 1 00 serves as a butterfly unit, with the first butterfly triplet (that 
accepting the input points) beginning with p=0, the next (sequentially after the first 
triplet) with p=l , etc. The output portion 39 is also given a value for "p", which is one 
greater than the sequentially last butterfly triplet. For example, in BFl 31 a of Fig. 3, the 
value of "p" is zero (BFl 31a being within the first triplet), whereas the value of "p" for 
BFl 31 b is one (which is one greater than the value of "p" for the last, and only, triplet). 
N is the number of points for which the IFFT circuit is designed. In the IFFT 30 of 
Fig.3, N=:16. Hence, BFl 31a has a buffer size "L^ " of 8, and BFl 31b has a buffer 
"L ^ " of 1 . The general BFl 1 00 includes a subtractor 1 04 and an adder 1 05. Control 
lines 1 06a and 1 06b are controlled by the control unit 36, and respectively control the 
selection output of two multiplexers 107a and 107b. Multiplexer 107a accepts as 
input the complex result 1 05a generated by the adder 1 05 and the data 1 03a output 
by the FIFO buffer 1 03, and selects either value 1 03a, 1 05a as the output X ^ (k) 1 02 
according to the control line 1 06a. Multiplexer 1 07b accepts as input the complex 
result 1 04a generated by the subtractor 1 04 and the input data X ^ (k) 1 01 , and 
selects either value 101 , 104a as output 103i according to the control line 106b, 
which output 103i is then fed as Input into the FIFO 103. Hence, FIFO 103 stores 
either results 1 04a from the subtractor 1 04, or input data X ^ (k) 101. The output X ^ 
(k) 1 02 Is either the output 1 03a from the FIFO 1 03, or the result 1 05a from the adder 
105. 

[0085] 

Please refer to Fig. 5 with reference to Figs. 2 and 3. Fig. 5 is a schematic drawing of 
a general butterfly unit BFII 200 according to the present invention. The general 
butterfly BFII 200 is used as the butterfly unit BFII 32, The principle of operation of the 
general BFII unit 200 is very similar to that of the general BFl unit 1 00. However, the 
general BFII 200 further includes a tt /2 complex rotator 208, and related control 
circuitry. The BFII 200 accepts a complex input 201 with each clock cycle, as 
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determined by step-count register 36a, and generates a complex output 202. Input 
201 is received from the output 102 of a general BFI 100. For example, BFII 32 accepts 
as input the output of BFI 31a in the processor circuit 30. FIFO buffer 203 is used to 
implement a delay feedback loop, with a buffer size "L ^ " given as: 

[0086] L2 = N/(4X8P) 

[0087] Again, "p" indicates the butterfly triplet number in which the general BFII 200 is 

located, and "N" is the point size of the IFFT processor. For the example circuit 30, the 

size "L " of FIFO 203 in BFII unit 32 is four (1 6/4 x 8 ^ = 4). The general BFII 200 
also includes a subtracter 204, an adder 205, the tt /2 complex rotator 208, and 

muliplexers 207a, 207b and 207c. Control lines 206a, 206b and 206c. which control 

the selection outputs of their respective MUXes 207a, 207b and 207c, are set by the 

control unit 36 according to the value held within the step-count register 36a. Exactly 

how the control lines 206a, 206b and 206c should be held for the circuit 30 is clearly 

shown in Fig. 2. 

[0088] Please refer to Fig. 6 with reference to Figs. 2 and 3. Fig. 6 is a schematic drawing of 
a general butterfly unit BFIII 300 according to the present invention. The general 
butterfly BFIII 300 is used as the butterfly unit BFIII 33. The principle of operation of 
the general butterfly unit BFIII 300 is very similar to that of the general butterfly unit 
BFII 200. However, the general BFIII 300 further includes a tt /4 complex rotator 308, 
and related control circuitry. The BFIII 300 accepts a complex input 301 with each 
clock cycle, as determined by step-count register 36a, and generates a complex 
output 302. Input 301 is received from the output 202 of a general BFII 200. For 
example, BFIII 33 accepts as input the output of BFII 32 in the processor circuit 30. 
FIFO buffer 303 is used to implement a delay feedback loop, with a buffer size "L ^ " 
given by: 

fOO^^l L^ = N/(8x8P) 



[0090] 



Again, "p" indicates the butterfly triplet number in which the general BFIII 300 is 
located, and "N" is the point size of the IFFT processor. For the example circuit 30, the 

size "L " of FIFO 303 in BFIII unit 33 is two (16/8x8^ = 2). The general BFIII 300 
also includes a subtracter 304, an adder 305, a tt /2 complex rotator 308, the tt /4 
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complex rotator 309, and four muliplexers 307a, 307b, 307c, and 307d. Control lines 
306a, 306b, 306c and 306d, which control the selection outputs of their respective 
MUXes 307a, 307b, 307c and 307d, are set by the control unit 36 according to the 
value held within the step-count register 36a. Exactly how the control lines 306a, 
306b, 306c and 306d should be held for the circuit 30 is clearly shown In Fig. 2. 

[0091] Output 302 from BFIII 33 is fed into the complex multiplier 38, along with a 

coefficient W"[k] provided by the control unit 36 from a coefficient table 36b. As with 
the butterfly control lines, the coefficient W"[k] Is determined by the value held within 
the step-count register 36a (that is, "k" is the step-count value 36a), and is indicated 
in Fig. 2. 

[0092] Finally the complex product output by the complex multiplier 38 is fed as input 

1 01 into BFI 3 1 b. The FIFO 1 03 of BFI 3 1 b is simply one unit in size, and control of the 
selectors is quite straightforward. 

[0093] Taking all of the delays incurred by the feedback loops into account, for the 16- 
point DIT IFFT circuit 30, 1 6 clock cycles after the first input X[0] is provided, the first 
result x[0] is provided as the output. Note, however, that the outputs x[n], which are 
the respective inverse fast Fourier transform of the inputs X[n], are not ordered in 
time, but instead appear sequentially as x[0], x[8], x[4], x[12], x[2], x[10], x[6], x[14], 
x[l], x[9], x[5], x[l 3], x[3], x[l 1 ], x[7] and finally x[l 5], 

[0094] Please refer to Fig. 7. Fig. 7 is a schematic drawing of a tt /2 complex rotator 400 

according to the present invention. The tt /2 complex rotator 400 is to implement the 

TT 12 complex rotator 308 in the general butterfly unit BFIII 300, and to Implement the 

TT 12 complex rotator 208 in the general butterfly unit BFII 200. Any complex number 

X (k) input into the tt /2 complex rotator 400 will have a real part X (k) 401a and 
I IR 

an imaginary part X (k) 401 b. Similarly, the output X ^ (k) from the tt 12 complex 

rotator 400 will have a real part X (k) 402a and an imaginary part X (k) 402b. 

OR Oi 

The output X ^ (k) is given by: X ^ (k) = X ^ (k) x (j). "j" being the square root of 
negative one. To perform a tt /2 complex rotation, the tt /2 complex rotator 400 
simply provides the input real part 401a as the output imaginary part 402b, and 
multiplies the input imaginary part 401 b by (-1) and provides the resulting product as 
the output real part 402a. Multiplying by (-1) is easily performed by the well-known 
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twos-complement procedure. Consequently, the tt /2 complex rotator 400 is very 
easy to implement. 

[0095] Please refer to Fig. 8. Fig. 8 is a schematic drawing of a tt /4 complex rotator 500 
according to the present invention. The tt /4 complex rotator 500 is used to 
implement the tt /4 complex rotator 309 in the general butterfly unit BFIII 300. The 
TT /4 complex rotator 500 is used to implement Eqn.5, accepting an input complex 
number X ^ (k) 501 and generating a corresponding output complex number X ^ (k) 
502 that is given by: 

[0096] X ^ (k) = (2 + 2 + 2 ""^ + 2 + 2 ) X (1 +j) X X I (k) 

[0097] The tt /4 complex rotator 500 includes a tt /2 complex rotator 503, the structure 
of which is indicated in Fig. 7 as the tt /2 complex rotator 400; a 2-to-l complex 
adder 504; five right shifters 505a-505e, and a 5-to-l complex adder 506. For an 
input number X ^ (k) 501 , the tt /2 complex rotator 503 generates as output 503o the 
value X I (k) X j. As input, the complex adder 504 accepts the output 503o and the 
original input X ^ (k) 501 , and thus generates as output 504o the value (1 +j) x X ^ (k). 

Shifter 505a right shifts output 504o by 1 , essentially multiplying output 504o by 2 

^ , and presents this result as output 507a. Shifter 505b right shifts output 504o by 3, 

which is the same as multiplying output 504o by 2 ^ , and presents this result as 
output 507b. Shifter 505c right shifts output 504o by 4, thereby multiplying output 
-4 

504o by 2 , and presents this result as output 507c. Shifter 505d right shifts 

-6 

output 504o by 6, multiplying output 504o by 2 , with the result as output 507d. 
Finally, shifter 505e right shifts output 504o by 8, generating as output 507e the 

-8 

value of 504o multiplied by 2 . The adder 506 accepts as input the complex values 
on lines 507a-507e, adding them together to generate the output value X ^ (k) 502. 

The TT /4 complex rotator 500 is thus shown to be relatively easy to implement, 

requiring only a tt /2 complex rotator 503 (which is also easy to implement), two 

complex adders 504 and 506, and five right shifters 505a to 505e. 

[0098] ji^g methodology used to Implement the present invention 1 6-point DIT IFFT 30 
of Figs. 2 and 3 can be scaled up to higher values N, as may be required, and the 
manner of doing so should be clear to one skilled in the art from the preceding 
discussion, utilizing the BFI 100, BFII 200 and BFIII 300 units with appropriate FIFO 
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sizes. For example, refer to Fig. 9. Fig. 9 is a process diagram for a 32-point radix-2 ^ 
DIT IFFT process according to the present invention, as derived from tlie equations 

previously discussed. Butterfly units BFI, BFII and BFII consistent with the general 

butterfly units BFI 100, BFII 200 and BFIII 300 of Figs.4, 5 and 6, respectively, are 

Indicated. In Fig. 9, the term W"4 is identified as the tt /4 complex rotator. The general 

coefficients W'n are given by Wn = exp(j x 2 tt x n/32). 

[0099] Please refer to Fig.l 0. Fig.l 0 Is a schematic design 600 for the 32-point radix-2 ^ 
DIT IFFT process of Fig. 9. The IFFT 600 clocks in as input 601 32 frequency values X 

[k], where "k" ranges from zero to 31 and is determined by the pipeline step-count 

register 606a within the control unit 606, and generates unordered output points x[n] 

602. The IFFT 600 Includes a butterfly triplet 607 multlplicatively connected to an 

output portion 609 by a complex multiplier 608. In this case, however, the output 

portion 609 includes a butterfly unit BFI 601 b serially connected to a butterfly unit BFII 

602b, as 32 = 3 ^ , and 5 mod 3 = 2. The butterfly unit BFII 602b serves as the output 
terminal of the IFFT circuit 600. All butterfly units BFI 601 a, 601 b; BFII 602a, 602b; 

and BFIII 603 are implemented by the general butterfly units BFI 1 00, BFII 200 and BFIII 

300, with appropriate value substitutions for "p" and "N" to determine the respective 

FIFO buffer sizes. For example, BFI 601a has a FIFO buffer size "L ^ " of 16; BFII 602a 

has a FIFO buffer size "L ^ " of 8, and BFIII has a buffer size "L ^ " of 4. In the output 

portion 609, with "p" equal to one, BFI 601 b has a FIFO buffer size "L ^ " of 2, and BFII 

602b has a buffer size "L " of 1 . 

2 

[01 00] States of the controls 605 for the various MUXes within the butterfly units BFI 
601a, 601b; BFII 602a, 602b; and BFIII 603 are determined by the value held within 
the pipeline step-count register 606a. These states can be determined from the 
process algorithm shown in Fig. 9, taking into account the various delays imposed by 
the butterfly units. General coefficients W'n are stored within a coefficient table 606b 
of the control unit 606, and are provided to the complex multiplier 608 based upon 
the value held within the step-count register 606a. In effect, as with the circuit 30, the 
outputs 605 of the control unit 606, which control the butterfly units 601a, 601b, 
602a, 602b, 603, and which provides complex values to the multiplier 608, are 
determined by a state machine as implemented by the control unit 606, with the 
current state indicated by the step-count register 606a. 
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Figs.l 1 A and 1 1 B are process diagrams for a 64-point radix-2 ^ DIT IFFT process 
according to the present invention. The associated DIT IFFT circuit 700 is shown in 

Fig. 12. Butterfly units BFI, BFII and BFII consistent with the general butterfly units BFI 
100, BFII 200 and BFII! 300 of Figs. 4, 5 and 6, respectively, are indicated. In Figs.l 1 A 
and 1 1 B, the ternn W8 is identified as the tt /4 complex rotator. The general 
coefficients 706b W'n are given by W'n = expO x 2 tt x n/64). The control unit 706 
can be thought of as a state machine, the state of which is determined by the step- 
count register 706a, Control outputs 705 are determined by the state 706a, and are 

consistent with the process algorithm depicted In Figs.l 1 A and 1 1 B. Note that output 

6 

portion 709, with "p" equal to 1 , is actually a complete butterfly triplet, as 64 = 2 , 
and 6 mod 3 = 0. 

As a final example, a 128-pomt radix-2 ^ DIT IFFT processor 800 according to the 
present invention is depicted in Fig.l 3. The output portion 809 includes a single BFI 

unit 801, as 128 2 , and 7 mod 3 = 1. The circuit 800 further includes two 
butterfly triplets 807a and 807b, with "p" values of zero and one, respectively. Output 

portion 809 thus has a "p" value of two. Butterfly triplet 807a is multiplicatlvely 

connected to butterfly triplet 807b by way of complex multiplier 808a. Butterfly triplet 

807b is multiplicatlvely connected to output portion 809 by way of complex multiplier 

808b. Coefficients W"l [k] and W"2[k] are respectively provided to the complex 

multipliers 808a and 808b from a coefficient table 806b according to the value held in 

the pipeline step-count register 806a. Determining the coefficients 806b, and the 

outputs 805 provided by the control unit 806 according to the step-count register 

806a, should be clear from the above disclosure to one skilled in the art. 

[01 03] Fig.l 4 is a simple block diagram of an IFFT/FFT processor 900 according to the 
present invention. When switches 901 are set to select complex conjugate circuitry 
902, the processor 900 serves as a DIT FFT processor, accepting position inputs l[x] 
and generating corresponding (but unordered) frequency outputs 0[x]. When switches 
901 are set to bypass the complex conjugate circuits 902, the processor 900 serves as 
a DIT IFFT, accepting frequency inputs l[x] and generating corresponding (but 
unordered) position outputs 0[x]. Each complex conjugate circuit 902 simply accepts 
an input complex value and outputs the complex conjugate of that input value. 
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[01 04] Regardless of the type of processor implemented, be it IFFT or FFT, the processor 
suffers from the fact that the output sequencing does not correspond to the input 
sequencing. This is true of both DIT and DIF processors. To correlate an input 
sequence with its corresponding output sequence, a reordering procedure must be 
performed. It would be desirable to have the sequencing of the Inputs match that of 
the outputs, and this is typically done byway of additional buffer memory. For an N- 
point real-time processor, two buffers each containing N complex number slots of 
memory Is typically thought to be required: one buffer to store the data streaming out 
of the processor, and another buffer used to stream out ordered data that has been 
completely received and buffered. However, it is, in fact, possible to use a memory 
that requires only N data slots, while simultaneously supporting and reordering a 
continuous stream of output that exceeds N complex numbers in length. We call this 
"two-phase memory address control". In the following discussion, for the sake of 
consistency with the above disclosure, DIT IFFT processors are considered. However, it 
will be appreciated that the disclosure is equally applicable to DIF FFT, DIF IFFT, or DIT 
FFT processors. 

[0105] 

Please refer to Fig.l 5. Fig.l 5 is a block diagram of a 1 6-point radix-2 DIT IFFT 
processor 1 000 that supports ordered outputs according to the present invention. The 

processor 1 000 contains the 1 6-point radix-2 ^ DIT IFFT unit 30 of Fig. 3, with the 
addition of a reordering circuit 1 100 connected to the output portion 1002 of the IFFT 

unit 30. The 1 6-point radix-2 ^ DIT IFFT unit 30 is used for the sake of convenience 
for a specific example of the present invention N-point reordering circuit. The 

reordering circuit 1 100 comprises as a buffering means a dual-port random access 

memory (RAM) 1 101 that can simultaneously support read and write operations in the 

same clock cycle, as indicated by the pipeline step count register 1 004. The RAM 

1 101 holds space, i.e., memory slots, for N complex numbers, addressable from zero 

to N-1 . As the processor 1 000 is a 1 6-point processor, N is 1 6. The RAM 1 1 01 thus 

has 16 complex number memory address slots, which may be addressed from zero to 

1 5. The reordering circuit 1 1 00 also contains as an address staggering means a latch 

1 101, such as a D-type flip-flop, for buffering a single memory address of the RAM 

1 101. Finally, the reordering circuit 1 100 requires some additions to the control unit 

1006, an address generating means in the form of an address look-up table 1 103, a 
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cycle bit 1 1 04, and any associated circuitry to support the functionality described in 
the following. Designing such additional support circuitry should be clear and obvious 
to one reasonably skilled in the art, and so is not elaborated upon here. 

[01 06] As part of an addressing means, the RAM 1 1 01 has a read address line 1 1 01 r and 
a write address line 1 101 w. A complex number on the output portion 1 002 of the IFFT 
unit 30 Is written into the RAM 1 1 01 at the memory address slot indicated by the write 
address line 1 lOlw. Similarly, the RAM 1101 generates as output 1003 the value 
contained in the memory address slot indicated by the read address lines 1 1 01 r. Such 
operations of the RAM 1 1 01 are familiar to those skilled in the art. The latch 1 1 02 is 
placed across the read address lines 1 1 01 r and the write address lines 1 1 01 w, so that 
the latch 1 102 obtains an address from the read address lines 1 1 01 r, and a next clock 
cycle later (as determined by the pipeline step-count register 1004), provides that 
address to the write address lines 1 1 01 w. The purpose of the latch 1 1 02 is simply to 
stagger the read and write addresses by one clock cycle, as measured by the pipeline 
step-count register 1004. This will be illustrated in more detail below. It is the control 
unit 1 006 that provides the read addresses 1 1 01 r (and by extension the write 
addresses 1 1 01 w) to the RAM 1101, by way of the address look-up table 1 1 03 and 
the cycle bit 1 1 04. The address look-up table 1 1 03 contains a list of addresses for 
addressing the RAM 1 1 01 In the form of entries 1 1 03i I ^ to I ^ ^ , and the cycle bit 
1 1 04 is used to determine the phase for memory addressing. After a complete cycle 
of N clock ticks (determined by the step-count register 1 004, and 1 6 in the present 
example), the cycle bit 11 04 is toggled. When the cycle bit 1 1 04 is set, the control 
unit 1 006 provides addresses 1 1 01 r according to values obtained from the entries 
1 103i in the address look-up table 1 1 03, indexed according the step-count register 
1004. When the cycle bit 11 04 is cleared, the control unit 1006 provides addresses 
1 1 01 r according to the step-count register 1 004. In both phases, the determining 
value used for indexing or addressing is simply one greater than the value held within 
the step-count register 1 004. The cycle bit 1 1 04 toggles (by way of cycle bit toggling 
means, such as a. comparator, bit wise logic, or the like) when the pipeline step-count 
register 1 004 reaches a value of N-1 , in this case, a value of 1 5. 

For the IFFT 30, 1 6 inputs X[0] to X[l 5] are clocked into the circuit 30 sequentially, 
at times T to T , respectively, with corresponding pipeline step-count values of 0 
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to 1 5, respectively. Output values x[0] to x[l 5] first begin appearing at output port 

1 002 at time T , as indicated by Table 1 below: 
1 6 

[01 08] 



Table 1 





Pipeline 




Time 


step- count value 


Output Value 
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vi roi 
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T„ 


12 
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t„~" 


14 




T„ 


15 





[01 09] To support the present invention as regards the IFFT processor 30, the address 
look-up table 1 1 03 has N entries, zero to N-1 , that simply follow the sequential 
ordering of the outputs x[n] as they occur in the time domain as given by the pipeline 
step-count register 1 004. These entries provide ordering decoding information, as 
shown in Table 2 below: 

[0110] 
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Table 2 



LOOK -up cable entr^ 
I* 


RAM Address value 


I. 


0 


li 


e 


I» 


4 


Ii 


12 


I* 
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I, 


10 


Is 


6 




14 


I. 


1 


I, 
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ii. 


5 


Ill 


13 




3 


Il3 


11 


II* 


7 


Its 


IS 



[01 1 1] To understand the operation of the reordering circuit 1 1 00, please refer to the 

following Table 3. Output IFFT output values 1 002 xl [n] correspond to IFFT Input 

values 1 001 from T ^ to T ^ ^ . Output values 1 002 x2[n] correspond to input values 

1 001 from T to T . Output values 1 002 x3[n] correspond to input values 1 001 
16 31 

from T to T 

32 47 

[01 12] 



Table 3 



Time 


Pipeline 
step-count 
value 


Cycle 
bit 


IFFT 

output 


Read 
address 


Urite 
address 
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T„ 
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t„ 


_ 


~~1 




14 


6 


Undefined 


T„ 
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11 
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7 
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. [0113] 
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[0114] 





Pipeline 
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When the cycle bit 1 1 04 is set to one, the control unit 1 006 adds one to the value 

held in the step-count register 1 004, and utilizes the result to index into the address 

look-up table 1 1 03 to obtain a read address. This read address is then provided on 

read address lines 1 1 01 r. The means for performing this action, the generation of a 

first phase address, should be trivial to implement for one of reasonable skill in the 

art. For example, at time T _ the cycle bit 1 1 04 is a one; the pipeline step-count 

1 6 

register 1 004 holds a value of 0; incrementing this value by one obtains an address 
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look-up table 1 103 index of one; entry 1 103i I ^ of the address look-up table 1 103 
contains the RAM memory address value of 8, as shown in Table 2. Hence, the RAM 
read address 1 1 01 r is 8 at time T When the cycle bit 11 04 is cleared, the control 
unit 1 006 sets the read address lines 1 1 01 r to be equal to one greater than the value 
held in the step-count register 1004. Again, the means for generating this second 
type of address, a second phase address, should be trivial to one in the art. In either 
case (i.e., either phase), one clock cycle later, as measured by the step-count register 
1004, the same address provided to the read address lines 1 101 r will be present upon 
write address lines 11 01 w, due to the latch 1 1 02. Data 1 002 is written into the RAM 
1101 at the write address 1 1 01 w, and read from the RAM 1101 as output 1 003 from 
the read address 1 002. When the pipeline step-count register 1 004 reaches a value of 
N-1 , which in this case is 1 5, the cycle bit 11 04 is toggled from zero to one. or one to 
zero, by the cycle bit toggling circuitry. Although an additional delay of N clock cycles 
is incurred, the end result Is that a real-time stream of ordered output values 1 002 
appears at the output 1 003. 

[01 1 6] The above concept of output reordering is actually quite general in nature. A 
stream of input data X[k] in a first local time domain Tl is transformed into a 
corresponding stream of data x[n] in a second local time domain T2 by a processor. 
Each local time domain, in the above example, is marked by a complete cycle of the 
pipeline step-count register 1 004, running from zero to N-1 , i.e., 1 5. Ordering, as 
applied here, means that each data point X[k] and x[n] satisfies the condition that if 
input data X[ p] occurs at time Tl ^ within the first local time domain Tl , where p is a 
number between zero to N-1 , i.e., 1 5, then the corresponding output data x[p] occurs 
at time T2 ^ within the second local time domain T2. Hence, although in the above 
example the inputs were sorted in ascending sequential order from X[0] to X[l 5], this 
is not a necessary condition for the present invention reordering scheme. It would be 
possible, for example, in a suitably designed circuit to provide X[l 5] to X[0] sorted in 
descending sequential order, and obtain at the output of the reordering circuit x[l 5] 
to x[0], again in descending sequential order. The present invention reordering circuit 
simply matches up the local time domains of the inputs with those of the outputs. 

^ Generalizing the above reordering circuit 1 1 00 for N points should be clear from 

the above description. That is, the above can easily be implemented for any value of 
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N, so long as the following condition holds: for unordered data {X , X , ... , X } 

0 1 n 

dispersed over a local time interval T defined by {T ^ , T , ... , T }, for each X 

0 1 n k 

occurring at time T , there occurs at time T . an X .A quick reference shows this to 
j k j 

hold true for Table 1 . For example, xl [8] occurs at pipeline step-count value 1 004 of 
1 , and xl [1 ] occurs at pipeline step-count value 1 004 of 8. A quick perusal of the 
process diagrams of Figs. 9 and 1 1 A, 1 1 B will also show these conditions to hold true. 

[01 1 8] It certainly isn't necessary to restrict the reordering unit of the present invention 
to reordering outputs for a DIT processor; that the present invention can be also 
applied to a DIF FFT processor. Moreover, the reordering circuit can be used on DIT 
FFT and DIF IFFT processors, which require unordered inputs and generate ordered 
outputs. Such an arrangement is shown in Fig.l 6. 

[01 19] The memory used in the above reordering circuits for buffering data should be 

capable of performing both a read and a write operation for each cycle of the pipeline, 
as Indicated by the pipeline step-count register (i.e., for each increment of the value 
held within the pipeline step-count register). This does not mean that a dual-ported 
RAM module is required. Such a design is only the preferred embodiment. It is fully 
possible for other designs that support a standard single-port RAM module. In this 
case, each pipeline operation would require at least two RAM bus cycles, so that read 
write operations could be performed during the same pipeline operation. The read 
and write address ports would also be the same. In one RAM bus cycle, the read 
address as obtained from the control unit would be used. In another write cycle the 
address as obtained from the address latch would be used. 

[01 20] Finally, it should be appreciated that many means may be used to generate an 
address for the first phase of the present invention reordering circuit. That is, an 
address look-up table is not the only means that may be used to generate a first 
phase address. Such addresses may, for example, be calculated. Consider, the 
following table: 

[0121] 
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Table 4 
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[01 22] Table 4 is basically identical to Table 2, but shows entries in binary as well as 

decimal. A look at the right hand column of Table 4 clearly shows that the entries in 
the look-up table are actually nothing more than the "reflection" of their 
corresponding indices. By "reflection", it is meant that the most significant bit (MSB) In 
the original becomes the least significant bit (LSB) in the reflection, the second MSB in 
the original becomes the second 15B in the reflection, and so on. For example, the 
entry at index (0001) has a value of (1 000). The entry at index (1 01 0) has a value of 
(01 01). Such simple bit-wise reflections are easily performed by appropriate logic, and 
can so eliminate the need for a look-up table. For example, in Fig.l 5, the address 
generating means In the IFFT control unit 1006 would include logic to add one to the 
step count register value 1 004 to generate an Intermediate result. Another set of logic 
would include circuitry to perform a bit-wise reflection of this intermediate result to 
generate a first phase address. Finally, a last set of logic would provide the first phase 
address to the read address lines 1 1 01 r when the cycle bit 1 1 04 is a one, and simply 
provide the intermediate result as the second phase address to the read address lines 
1 1 01 r when the cycle bit 1 1 04 is a zero. Further, it should be appreciated that 
addresses, whether first phase or second phase, can be shifted by a base value (that 
is, offset from zero) while still keeping to the spirit of the present invention. 

[0123] 

In contrast to the prior art, the present invention provides a butterfly triplet, which 
is composed of a BFI unit, a BFII unit and a BFIII unit, and an output portion that 
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contains at least a BFI unit, and which is connected to the butterfly triplet by way of a 
complex multiplier. The BFII unit includes a tt /2 complex rotator, and the BFIII 
includes both a tt /2 and a tt /4 complex rotator. All of the BFI, BFII and BFIII units are 
controlled by control circuitry according to a pipeline step-count value, as are the 
coefficients provided to the complex multiplier. In addition, the present invention 
provides a reordering circuit that ensures that the sequence ordering of the inputs 
matches that of the outputs in the time domain. For an N-point real-time processor, 
the reordering circuit requires a buffer memory having only N slots for storing N 
complex numbers. This memory is sufficient to provide real-time streaming ordered 
inputs and outputs that exceeds N points in length, and that is, in fact, of unlimited 
and unbroken length. Read and write access to the reordering buffer memory is 
staggered so that a read at an address in the reordering buffer memory is immediately 
followed by a write to the same address, but one pipeline cycle later. Utilization of an 
address look-up table controls the read address used to fetch from (and hence write 
to) the reordering buffer. The address table is indexed according to a value obtained 
from a pipeline step-count register. 

[01 24] Those skilled in the art will readily observe that numerous modifications and 

alterations of the device may be made while retaining the teachings of the invention. 
Accordingly, the above disclosure should be construed as limited only by the metes 
and bounds of the appended claims. 

[0125] 
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